A Review Corpus for Argumentation Analysis

نویسندگان

  • Henning Wachsmuth
  • Martin Trenkmann
  • Benno Stein
  • Gregor Engels
  • Tsvetomira Palakarska
چکیده

The analysis of user reviews has become critical in research and industry, as user reviews increasingly impact the reputation of products and services. Many review texts comprise an involved argumentation with facts and opinions on different product features or aspects. Therefore, classifying sentiment polarity does not suffice to capture a review’s impact. We claim that an argumentation analysis is needed, including opinion summarization, sentiment score prediction, and others. Since existing language resources to drive such research are missing, we have designed the ArguAna TripAdvisor corpus, which compiles 2,100 manually annotated hotel reviews balanced with respect to the reviews’ sentiment scores. Each review text is segmented into facts, positive, and negative opinions, while all hotel aspects and amenities are marked. In this paper, we present the design and a first study of the corpus. We reveal patterns of local sentiment that correlate with sentiment scores, thereby defining a promising starting point for an effective argumentation analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Argumentation for Scientific Claims in a Biomedical Research Article

This paper provides an analysis of some argumentation in a biomedical genetics research article as a step towards developing a corpus of articles annotated to support research on argumentation. We present a specification of several argumentation schemes and inter-argument relationships to be annotated.

متن کامل

From Discourse Analysis to Argumentation Schemes and Back: Relations and Differences

In argumentation theory, argumentation schemes are abstract argument forms expressed in natural language, commonly used in everyday conversational argumentation. In computational linguistics, discourse analysis have been conducted to identify the discourse structure of connected text, i.e. the nature of the discourse relationships between sentences. In this paper, we propose to couple these two...

متن کامل

Identifying Argumentation Schemes in Genetics Research Articles

This paper presents preliminary work on identification of argumentation schemes, i.e., identifying premises, conclusion and name of argumentation scheme, in arguments for scientific claims in genetics research articles. The goal is to develop annotation guidelines for creating corpora for argumentation mining research. This paper gives the specification of ten semantically distinct argumentatio...

متن کامل

Towards Creation of a Corpus for Argumentation Mining the Biomedical Genetics Research Literature

Argumentation mining involves automatically identifying the premises, conclusion, and type of each argument as well as relationships between pairs of arguments in a document. We describe our plan to create a corpus from the biomedical genetics research literature, annotated to support argumentation mining research. We discuss the argumentation elements to be annotated, theoretical challenges, a...

متن کامل

Argumentative meanings and their stylistic configurations in clinical research publications

The paper reports on the results of an exploratory study into the topical organisation and stylistic features of argumentation in a corpus of ophthalmic clinical research papers. The study responds to the need for systematised and generalisable argumentation models in knowledgeintensive fields. We present here a schematised superstructure of the arguments from the corpus, charting the configura...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014